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Abstract — This paper proposes a robust reduced-rank scheme 
for adaptive beamforming based on joint iterative optimization 
(JIO) of adaptive filters. The novel scheme is designed according 
to the constant modulus (CM) criterion subject to different 
constraints, and consists of a bank of full-rank adaptive filters 
that forms the transformation matrix, and an adaptive reduced- 
rank filter that operates at the output of the bank of filters to 
estimate the desired signal. We describe the proposed scheme 
for both the direct-form processor (DFP) and the generalized 
sidelobe canceller (GSC) structures. For each structure, we 
derive stochastic gradient (SG) and recursive least squares 
(RLS) algorithms for its adaptive implementation. The Gram- 
Schmidt (GS) technique is applied to the adaptive algorithms 
for reformulating the transformation matrix and improving 
performance. An automatic rank selection technique is developed 
and employed to determine the most adequate rank for the 
derived algorithms. The complexity and convexity analyses are 
carried out. Simulation results show that the proposed algorithms 
outperform the existing full-rank and reduced-rank methods in 
convergence and tracking performance. 



I. Introduction 

Adaptive beamforming techniques have been developed to 
improve the reception of a desired signal while suppressing 
interference at the output of a sensor array. It is an ubiquitous 
task in array signal processing with applications in radar, 
sonar, astronomy, and more recently, in wireless commu- 
nications (T]-|3). A number of adaptive algorithms for the 
beamformer design are available and have been extensively 
studied 0, |4|. The most common are the linearly constrained 
adaptive algorithms ll6l- lfT0l . In general, the linear constraints 
correspond to prior knowledge of certain parameters such as 
the direction of arrival (DOA) of the desired signal. 

An important issue that is considered in adaptive beam- 
forming is the design criterion. Among many adaptive al- 
gorithms found in the literature, the most promising criteria 
employed are the constrained minimum variance (CMV) J5] 
and the constrained constant modulus (CCM) |4) due to 
their simplicity and effectiveness. The CMV criterion aims 
to minimize the beamformer output power while maintaining 
the array response on the DOA of the desired signal. The 
CCM criterion is a positive measure [£0 of the deviation of the 
beamformer output from a constant modulus condition subject 
to a constraint on the array response of the desired signal. 
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By measuring the deviation, the CCM criterion provides more 
information than the CMV for the parameter estimation of 
constant modulus constellations in the beamformer design. 

Numerous adaptive algorithms have been proposed ac- 
cording to the constrained version criteria to realize the 
beamformer design ll6l- lfl"Tl . The major drawback of 

the full-rank methods, such as stochastic gradient (SG) lfl2l 
and recursive least-squares (RLS) ifPTl . 04], is that these 
methods require a large amount of samples to reach the 
steady-state when the number of elements in the filter is 
large. Furthermore, in dynamic scenarios, filters with many 
elements usually fail or provide poor performance in tracking 
signals embedded in interference and noise. Reduced-rank 
signal processing was originally motivated to provide a way 
out of this dilemma lfl8l - ll24l . For the application of beam- 
forming, reduced-rank schemes project the received vector 
onto a lower dimensional subspace and perform the filter 
optimization within this subspace. One of the popular reduced- 
rank schemes is the multistage Wiener filter (MSWF), which 
employs the minimum mean squared error (MMSE) (26 1, and 
its extended versions that utilize the CMV and CCM criteria 
were reported in 11271 . 11281 . Another technique that resembles 
the MSWF 129), EOl is the auxiliary-vector filtering (AVF) 
BP . [32]. A joint iterative optimization (JIO) scheme, which 
was presented recently in 11371 . employs the CMV criterion 
with a relative low-complexity adaptive implementation to 
achieve better performance than the existing methods. 

In this paper, we introduce a robust reduced-rank scheme 
based on joint iterative optimization of filters with the CCM 
criterion in detail and compare it with that of the CMV to 
show its improved performance in the studied scenarios. The 
developed CCM reduced-rank scheme consists of a bank of 
full-rank adaptive filters, which constitutes the transformation 
matrix, and an adaptive reduced-rank filter that operates at 
the output of the bank of full-rank filters. The transformation 
matrix maps the received signal into a lower dimension, 
which is then processed by the reduced-rank filter to estimate 
the transmitted signal. The proposed scheme provides an 
iterative exchange of information between the transformation 
matrix and the reduced-rank filter and thus leads to improved 
convergence and tracking performance. 

This paper makes two contributions: 

• A reduced-rank scheme according to the constant mod- 
ulus (CM) criterion subject to different constraints is 
proposed based on the JIO of adaptive filters. This robust 
reduced-rank scheme is investigated for both direct-form 
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processor (DFP) and the generalized sidelobe canceller 
(GSC) (14\ structures. For each structure, a family of 
computationally efficient reduced-rank SG and RLS type 
algorithms are derived for the proposed scheme. The 
Gram-Schmidt (GS) technique is employed in the pro- 
posed algorithms to reformulate the transformation matrix 
for further improving performance. An automatic rank 
selection technique is developed to determine the most 
adequate rank for the proposed algorithms. 
• The complexity comparison is presented to show the 
computational costs of the proposed reduced-rank algo- 
rithms. An analysis of the convergence properties for the 
proposed reduced-rank scheme is carried out. Simulations 
are performed to show the improved convergence and 
tracking performance of the proposed algorithms over ex- 
isting methods. The effectiveness of the GS and automatic 
rank selection techniques for the proposed algorithms is 
visible in the results. 

The remainder of this paper is organized as follows: we out- 
line a system model for beamforming in Section II. Based on 
this model, the full-rank and reduced-rank CCM beamformer 
designs are reviewed. The proposed reduced-rank scheme 
based on the CM criterion subject to different constraints is 
presented in Section III, and the proposed adaptive algorithms 
are detailed for implementation in Section IV. The complexity 
and convergence analysis of the proposed algorithms is carried 
out in Section V. Simulation results are provided and discussed 
in Section VI, and conclusions are drawn in Section VII. 



B. Full-rank CCM Beamformer Design 

The full-rank CCM linear receiver design for beam- 
forming is equivalent to determining a filter w(i) = 
[wi(i), . . . , w m (i)] T G C mxl that provides an estimate of 
the desired symbol y(i) = w H (i)x(i), where {-) H denotes 
Hermitian transpose. The calculation of the weight vector is 
based on the minimization of the following cost function: 

J cm (w(i)) =E{[|i/(<)| p -i/] 2 } ) 
subject to w H (i)a(6o) = 7, 

where v is suitably chosen to guarantee that the weight 
solution is close to the global minimum and 7 is set to ensure 
the convexity of ((2j ||28l . The quantity 9 is the direction of 
the desired signal, a(9 ) denotes the corresponding normalized 
steering vector, and in general, p = 2 is selected to consider the 
cost function as the expected deviation of the squared modulus 
of the beamformer output to a constant, say v = 1. The 
CCM criterion is a positive measure H of the deviation of the 
beamformer output from a constant modulus condition subject 
to a constraint on the array response of the desired signal. 
Compared with the CMV criterion, it exploits a constant mod- 
ulus property of the transmitted signals, utilizes the deviation 
to provide more information for the parameter estimation of 
the constant modulus constellations, and achieves a superior 
performance |[T5l , ||28l . The CCM beamformer minimizes the 
contribution of interference and noise while maintaining the 
gain along the look direction to be constant. The weight 
expression of the full -rank CCM design is given in ll28l . 



II. System Model and CCM Beamformer Design 

In this section, we first describe a system model to express 
the received data vector. Based on this model, the full-rank 
beamformer design according to the CM criterion subject to 
the constraint on the array response is introduced for the DFP 
and the GSC structures. 



A. System Model 

Let us suppose that q narrowband signals impinge on a 
uniform linear array (ULA) of m (m > q) sensor elements. 
The sources are assumed to be in the far field with DOAs 
9o,...,6 q -i. The received vector x(i) <E C mxl at the ith 
snapshot can be modeled as 

x(i) = A(6)s(i) + n(i), i = l,...,N, (1) 

where 9 = [6 , . . . , 9 q - 1 ] T G R qxl is the signal DOAs, 
A{6) = [a(9 ),...,a(6 q -i)] G C mxq comprises the nor- 
malized signal steering vectors a(9k) = [1, e~ 27r -' 1 ^ cos6 ' fc , . . ., 
e -a*i(m-i3£c«w9 fc ]T e C mxi ; (ft = , . . . , 5 - 1), where A c 
is the wavelength and u (u = A c /2 in general) is the inter- 
element distance of the ULA, and to avoid mathematical am- 
biguities, the steering vectors a(9k) are assumed to be linearly 
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independent. s(i) G C 9 is the source data, n(i) G 
temporary white sensor noise, which is assumed to be a zero- 
mean spatially and Gaussian process, N is the observation size 
of snapshots, and ( ) T stands for transpose. 



C. Reduced-rank CCM Beamformer Design 

For large m, considering the high computational cost and 
poor performance associated with the full-rank filter, a number 
of recent works in the literature have been reported based 
on reduced-rank schemes lfl8l - ll22l . jl26) - ll46) . Here, we will 
describe a reduced-rank framework that reduces the number 
of coefficients by mapping the received vector into a lower 
dimensional subspace. The diagrams of the reduced-rank 
processors are depicted for the DFP and the GSC structures 
in Fig. [TJa) and Fig. Gib), respectively. 
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Fig. 1. Reduced-rank scheme for (a) the DFP and (b) the GSC structures. 

1) Beamformer Design for the DFP : In the DFP structure, 
T r £ C n x r denotes the transformation matrix that includes a 
set of m x 1 vectors for a r-dimensional subspace with r < m. 
The transformation matrix maps the received vector x(i) into 
its low-dimension version x(i) G C rxl , which is given by 



x(i)=T?(i)x{i), 



(3) 



where, in what follows, all r-dimensional quantities are de- 
noted by an over bar. An adaptive reduced-rank CCM filter 
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w(i) G C rxl follows the transformation matrix to produce 
the filter output y(i) = w H (i)x(i). 

Substituting the expression of y(i) into the cost function 
in (O and calculating for the reduced-rank weight vector, we 
have ED 

wh + 1) = R M<p(i)-- = — = >, 

(4) 

where R(i) = E[\y(i)\ 2 T? (i)x(i)x H (i)T r {i)} G C rxr , 
a(6 ) = T?a{9 ) G C rxl , and p(i) = E[y*{i)T? (*>(*)] £ 
C rxl . Note that the expression in (0]i is a function of previous 
values of w{i) (since y(i) = w H (i)x(i)) and thus must be 
initialized to start the computation for the solution. We keep 
the time index in R(i) and p(i) for the same reason. 

2) Beamformer Design for the GSC: The GSC structure 
converts the constrained optimization problem into an un- 
constrained one and adopts an alternative way to realize the 
beamformer design. The full-rank CCM beamformer design 
with respect to the GSC structure has been reported in ll42l . 
Here, we employ an alternative way proposed in ||47l , [|48l 
to describe a reduced-rank GSC structure. As can be seen in 
Fig. 01b), the reduced-rank GSC structure composes a con- 
strained component (a 7 ($o) = 7 a ($o)) an d an unconstrained 
component. x(i) is a new received vector defined as 

= ygscW^W' ( 5 ) 

where y gsc (i) = w H (i)x(i). The definition of x(i) is valid 
for p = 2 in (O and |j/ gS c(i)| 2 = w H (i)x(i). This expression 
is only to favor its use in the GSC structure for the case 
of the CM cost function. Note that y gsc (i) and y(i) (full- 
rank or reduced-rank with T r = I mxm ) correspond to the 
same values but are written in a different way to indicate the 
structures (DFP and GSC). 

For the constrained component, the output is do(i) = 
a,H \6q)x{i). With respect to the unconstrained component, 
the new received vector passes through a signal blocking 
matrix B g ct™- 1 )*™ to get a transformed vector x B (i) G 
CO™" 1 )* 1 , which is 

x B (i) = Bx(i), (6) 

where B is obtained by the singular value decomposition 
or the QR decomposition algorithms P4l . Thus, Ba(9^) = 
0( m -i)xi means that the term B effectively blocks any signal 
coming from the look direction 9q. The transformation matrix 
Tgsc(i) g C( m ~ 1 ) xr maps the transformed vector x B (i) into 
a low-dimension version, as described by 

x B (i)=T* c (i)x B (i). (7) 

The reduced-rank received vector x B (i) is processed by a 
reduced-rank filter u> gsc (i) G C rxl to get the unconstrained 
output yo(i) = w gsc (i)x B (i). The reduced-rank weight vector 

is EE) 

M> gBC (i + 1) = R^ B (i)p B (i), (8) 

where R Sb (i) = E[T^ c (t)x B (i)x^{i)T gS0 (t)} G C rxr and 
p B {i) = E[(dj5(i) - l)Tf sc (i)x B (i)} G C rxl . Note that this 
expression is a function of previous values of the weight vector 



and therefore must be initialized to start the computation for 
the solution. 

The reduced-rank GSC structure can be concluded in a 
transformation operator S = [a 7 (0 o ), B H T gsc ] H G C (r+1) x 
m and a reduced-rank weight vector w' = [1, — w^ sc ] H G 
£>0+i)xi rp ne e q U j va j ent full-rank weight vector can be 
expressed as 

w(i + l) = S H w'(i + l) 

= a 7 (9 Q ) - B H T gsc (i + l)w gsc (i + 1). 

The reduced-rank weight expressions in (0) for the DFP and 
in (O for the GSC are general forms to the signal processing 
tasks. Specifically, for r = in (DFP) and r = m — 1 (GSC), the 
expressions are equivalent to the full-rank filtering schemes 
Ifltl . For 1 < r < m (DFP) and 1 < r < m - 1 (GSC), 
the signal processing tasks are changed and the reduced-rank 
filters estimate the desired signals. 

The challenge left to us is how to efficiently design and cal- 
culate the transformation matrices T r and T gsc . The principal 
components (PC) method reported in ITT81 uses the eigenvec- 
tors of the interference-only covariance matrix corresponding 
to the eigenvalues of significant magnitude to construct the 
transformation matrix. The cross-spectral (CS) method lETl . 
a counterpart of the PC method belonging to the eigen- 
decomposition family, forms the transformation matrix by 
using the eigenvectors which contribute the most towards max- 
imizing the SINR and outperforms the PC method. Another 
family of adaptive reduced-rank filters such as the MSWF ||26) . 
Il27l and the AVF [31] generates a set of basis vectors as the 
transformation matrix that spans the same Krylov subspace 

ED, ED. 

III. Proposed CCM reduced-rank scheme 

In this section, we introduce the proposed reduced-rank 
scheme based on the JIO approach. Two optimization prob- 
lems according to the CM criterion subject to different con- 
straints are described for the proposed scheme. Based on 
this scheme, we derive the expressions of the transformation 
matrix and the reduced-rank weight vector. For the sake of 
completeness, the proposed scheme is realized for both the 
DFP and the GSC structures. 

A. Proposed CCM Reduced-rank Scheme for the DFP 

Here we detail the principles of the proposed CCM reduced- 
rank scheme using a transformation based on adaptive fil- 
ters. For the DFP structure depicted in Fig. 0a), the pro- 
posed scheme employs a transformation matrix T r (i) G 
C mXf , which is responsible for the dimensionality reduc- 
tion, to generate x(i) G C rxl . The dimension is reduced 
and the key features of the original signal is retained 
in x(i) according to the CCM criterion. The transforma- 
tion matrix is structured as a bank of r full-rank filters 

= [*lj(»).*2j(»), • ■ • ,tm,M T e Cm><1 < U = !,•••,»•) 

as given by T r (i) = [ti (i) , t2 («),-•- , t r (*)] ■ An adaptive 
reduced-rank filter w(i) G C rxl is then used to produce 
the output. The transformation matrix T r (i) and the reduced- 
rank filter w(i) are jointly optimized in the proposed scheme. 
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The filter output is a function of the received vector, the 
transformation matrix, and the reduced-rank weight vector, 
which is 
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Fig. 2. Proposed reduced-rank scheme for (a) the DFP and (b) the GSC 
structures. 



y(i) = w H {i)T? (i)x(i) = w H (i)x(i). 



(10) 



We describe two optimization problems according to the 
CM cost function subject to different constraints for the 
proposed reduced-rank scheme, which are given by 

Problem 1 : min J cm (T r (i), w(i)) = E^[\y(i)\ 2 - lfj 
subject to w H {i)Tr (i)a(0 o ) = 7, 

(11) 

Problem 2 : min J cm (T r (£), w(i)) = E^[\y(i)\ 2 - l] 2 | 

subject to w H \i)Tf (i)a(9 ) = 7 and T^{i)T r {i) = I. 

(12) 

Compared with ( fTTT i. the problem in (TTZb has an orthogonal 
constraint on the transformation matrix, which is to refor- 
mulate T r (i). The transformation matrix generated from 
(fTTT i has vectors that may perform a similar operation (e.g., 
take the same information twice or more), thereby making 
poor use of the data and losing performance. The subspace 
computed with (fT2l . which spans the same subspace as T r (i), 
generates basis vectors that are orthogonal to each other and 
which does not affect the noise statistically. The reformulated 
transformation matrix performs an efficient operation to keep 
all useful information in the generated reduced-rank received 
vector, which is important to estimate the desired signal and 
improve the performance. In the following, we will derive the 
CCM expressions of T r (i) and w(i) for solving (fTTT i and (fT2l . 

The cost function in ( fTTT > can be transformed by the method 
of Lagrange multipliers into an unconstrained one, which is 



\(i)Mi)) =®{[\w H (i)T?(i)x(i)\ 2 -l] 2 } 
+ 29i{\[w H (i)T?(i)a(6 )- 1 ]}, 



(13) 



where A is a scalar Lagrange multiplier and the operator 
selects the real part of the argument. 

Assuming w(i) is known, computing the gradient of dT~3l > 
with respect to T r {i) (matrix calculus), equating it to a null 
matrix and solving for A, we have 

T r {i + 1) = fl-^ojpWia^Ci)- 

(14) 



w H (i)R a \i)w(t)a H (e )R- 1 (i)a(e Q ) 



where p(i) = E[y*(i)x(i)} G C mxl ,_ R(i) = 

E[\y{i)\ 2 x{i)x H {i)] G C mxrn^ an( j 

E[w(i)w H (i)] e C rxr . Note that the reduced-rank weight 
vector w(i) depends on the received vectors that are random 
in practice, thus Rw(i) is r-rank and invertible. R(i) and 
p(i) are functions of previous values of T r (i) and w(i) due 
to the presence of y(i). Therefore, it is necessary to initialize 
T r (i) and w(i) to estimate R(i) and p(i), and start the 
computation. 

On the other hand, assuming T r (i) is known, computing 
the gradient of ( TT3l with respect to w(i), equating it to a null 
vector, and solving for A, we obtain 



w(i + l) = R 1 (i){p(i 



d ff (e o )fl _1 (i)a(0o) 



(15) 



where R(i) = E[\y(i)\ 2 T? (i)x(i)x H (i)T r (i)] G C rxr , 
p(i) = E[y*(i)T*(i)x(i)] G C rxl , and a(6 ) = 
T?(i)a(6 ). 

Note that the expressions in (TBI for the transformation 
matrix and $15[ for the reduced-rank weight vector can be 
applied to solve the optimization problem (fT2l . The orthogonal 
constraint in ( TTZb can be realized by the Gram-Schmidt (GS) 
technique, which will be illustrated in the next section. 



B. Proposed CCM Reduced-rank Scheme for the GSC 

For the GSC structure, as depicted in Fig. |3b), the proposed 
scheme utilizes a transformation matrix T gsc (i) G C^ m_1 ^ xr 
to map the new transformed vector xb(i) G c' m_1 ^ xl into a 
lower dimension, say Xs{i) = T^ sc (i)xB(i) G C rxl . In our 
design, the transformation matrix T gsc (i) and the reduced-rank 
weight vector w ssc (i) for the sidelobe of the GSC are jointly 
optimized by minimizing the cost function 

"^crn-gsc (^gsc (^) j ^gsc (O) — 

E{ [(a 7 (9 Q ) - B H T ssc (i)w gsc (i)) H x(i) - l] 2 }, 

(16) 

where the expression in ([TBI for the GSC is obtained by 
substituting <(5j and (0 into (O with p = 2. This is an 
unconstrained cost function that corresponds to (fTTV From Fig. 
12(b), this structure essentially decomposes the adaptive weight 
vector into constrained (array response) and unconstrained 
components (see also Eq. (O). The unconstrained component 
can be adjusted to meet the CM criterion since the constrained 
component always ensures that the constrained condition is 
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satisfied. Thus, the proposed GSC framework converts the 
constrained optimization problem into an unconstrained one. 

Assuming w gsc (i) and T gsc (i) are given, respectively, com- 
puting the gradient of (TToT l with respect to T gsc (i) and w gsc (i), 
and solving the equations yields 

T gsc (* + 1) = ^(i)p B (i)<(i)^(i), (17) 

w ssc (i + l) = R^l(i)p B (i), (18) 

where Rj. B {i) and p B (i) have been defined in the pre- 
vious section, and R^^) = K[w gic (i)w^ sc (i)] e C rxr . 
-R«w(*) is invertible since w gsc (i) depends on the ran- 
dom received vector and R^^^i) is a full-rank matrix. 

R Sb (i) = E[Tf sc (i)x B (i)x§(i)T gsc (i)} e C rxr and 

p B (i) = E[(dg(t) - l)T*(i)sB B (*)] e C rxl . Again, the 
orthogonal constraint on the transformation matrix can be en- 
forced in the optimization problem (TToT l and the GS technique 
is employed to realize this. 

Note that the filter expressions in (fl4l i and ( fT5t for the DFP 
and (TTTb and ( TT8l for the GSC are not closed-form solutions. In 
the DFP structure, the expression of the transformation matrix 
in ( TT~4T > is a function of w(i) and the reduced-rank weight 
vector obtained from (fT5t depends on T r (i). It is necessary to 
set initial values of T r (i) and w(i) for the update procedures. 
Thus, initialization about the transformation matrix and the 
reduced-rank weight vector is not only to get a beamformer 
output y(i) for estimating R(i) and R(i), but to start the 
computation of the proposed scheme. In the case of the GSC, 
we initialize T gsc (i) and w gsc (i) with the same intention. 

Unlike the MSWF J26) and the AVF EB techniques in 
which the transformation matrix is computed independently 
from the reduced-rank filter, the proposed scheme provides an 
iterative exchange of information between the transformation 
matrix and the reduced-rank filter, which leads to improved 
convergence and tracking performance. The transformation 
matrix reduces the dimension of the received vector whereas 
the reduced-rank filter attempts to estimate the desired signal. 
The key strategy lies in the joint iterative optimization of the 
filters. In the next section, we will derive iterative solutions via 
simple adaptive algorithms and introduce an automatic rank 
selection technique for the adaptation of the rank r. 

IV. Adaptive Algorithms of The Proposed CCM 
Reduced-rank Scheme 

We derive SG and RLS type algorithms for the proposed 
CCM reduced-rank scheme. Some related works can be found 
in ifTTI - fPSl . In this paper, the adaptive algorithms are 
described for the DFP and the GSC structures, respectively, 
to perform joint iterative updates of the transformation matrix 
and the reduced-rank weight vector. They are used to solve 
Problem 1. The Gram-Schmidt (GS) technique is employed 
in these algorithms and imposes an orthogonal constraint on 
the transformation matrix to solve Problem 2. An automatic 
rank selection technique is introduced to determine the most 
adequate rank for the proposed methods. 



TABLE I 

THE JIO-CCM-SG ALGORITHM FOR DFP 



Initialization: 

T r (l) = [Irxr rx ( m _ r )] ; 

w(l) = If (l)a 7 (0 o )/(||T? (l)a 7 (0 o )|| 2 ). 
Update for each time instant i 

y(i) = w H (i)Jf (»)*(»); e(i) = \y(i)\ 2 - 1 

T r (i + 1) = T r (i) - n TT e{i)y*(i) [I - a{6 )a H {6 )]x(i)w H (i) 

y(i) = w H {i)T^{i + l)x(i); e(i) = \y(i)\ 2 - 1 

a(9 ) = if (i + l)a(0 o ), x(i) = if (i + l)x(i) 

w(i + 1) = gffl - ^e(i)y*(i) [I - ffi> n s S ] *W 



A. Stochastic Gradient Algorithms 

Here, we derive the SG algorithms with the proposed CCM 
reduced-rank scheme for both the DFP and the GSC structures. 

1) SG algorithm for the DFP: Assuming w(i) and T r (i) 
are known, respectively, computing the instantaneous gradient 
of < fT~3T > with respect to T r (i) and w(i), we obtain 

VL cmr (i) (i) = 2e{i)y*(i)x(i)w H (i) + 2\* Tr a{0 )w H (i), 

(19) 

VL cmffi(i) (i) = 2e(i)y*(i)T? (i)x(i) + 2Allf (i)a(0 o )> 

(20) 

where e(i) = \y{i)\ 2 — 1. 

Following the gradient rules T r (i + 1) = T r (i) — 
/T r VJ lm (i) and w(i + 1) = w(i) - (J,wVJun m(i) (i), sub- 
stituting ( fl9l and (120b into them, respectively, and solving the 
Lagrange multipliers At,, and by employing the constraint 
in ( fTTT i. we obtain the iterative SG algorithm for the DFP, 
which is denominated as JIO-CCM-SG. A summary of this 
algorithm is given in Table [I] where fiT r and are the 
corresponding step size factors for the DFP, which are small 
positive values. The initialization values are set to satisfy 
the constraint in (fTTT i. The transformation matrix T r (i) and 
the reduced-rank weight vector w(i) operate together and 
exchange information at each time instant. 

2) SG algorithm for the GSC: For the GSC structure, 
assuming w gsc (i) and T gsc (i) are given in ([Tol l, respectively, 
we get 

VJcm- gS c TgscW (i) = e* gsc (i)x B (i)w£ c (i), (21) 

VJ cm - gsCiiWi) (i) = e* gsc (i)x B (i), (22) 

where e esc (i) = 1 — w H (i)x(i) and w(i) is obtained from <j9j . 

Substituting (fJTJ and (l22l into the gradient rules, we obtain 
the iterative SG algorithm for the GSC, which is summarized 
in Table []]] where fir and fiw are the corresponding step 
size factors for the GSC. 



B. Recursive Least Squares Algorithms 

In this part, we derive the RLS algorithms with the proposed 
CCM reduced-rank scheme for both the DFP and the GSC 
structures. 
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TABLE II 

THE JIO-CCM-SG ALGORITHM FOR GSC 



TABLE III 

THE JIO-CCM-RLS ALGORITHM FOR DFP 



Initialization: 

Tgsc(l) = [Irxr OrX(m— r)J I *Wgsc(l) ~ Irxl- 

Update for each time instant i 

w(i) = a 7 (0o) - B H T gsc (i)w gsc (i), y gS c(i) = w H (i)x(i) 

= y g A i ) x ( i )' x B (i) = Bx(i), e gsc (i) = 1 - w H (i)x(i) 
T gsc (i + 1) = T gsc (i) - VT r e gsc {i)x B (i)w gsc (i) 
w(i) = a 7 (6» ) - B H T gsc (i + l)wv(i) 
2/ g8 c(«) = w H (i)x(i), x(i) = y* sc (i)x(i), x B {i) = Bx(i) 
e gsc (i) = 1 - (i)S(i), x B (i) = T^ c (i + l)sB B (i) 



c (z + 1) = Wg S c(») - fJ-w^el^^xsii) 



1) RLS algorithm for the DFP: Considering the DFP case, 
the unconstrained least squares (LS) cost function is given by 



L m (T r (i),w(i))=J2^- l [\w H (i)Tf(i) X (l)\ 2 -lY 
i=i 

+ 2m{\[w H (i)T?(i)a(6 )-7]}, 



(23) 



where a is a forgetting factor chosen as a positive constant 
close to, but less than 1. 

Assuming w(i) and T r (i) are known in d23l . respectively, 
we obtain 



2V(i + 1) = 



R 



l (i){p(i) 



[p H (i)R (i)a(e )--f]a(e ) X w H (i) 



a H (6 )R (i)a(flo) 



}- 



(24) 



[^( l )H- 1 ( i )a(0 o ) 



TOW 



(25) 



where 

EU a l - l \y(l)fx(l)x H (l), p(i) = £j =1 o*-y (0x(0, and 
p(i) = E;=i al_ 'y*(0 :E (0 w i m 2/(0 expressed in ( fTOi i. The 
derivation of (124-b is given in the appendix. Note that R(i) is 
not invertible if i < m. It can be implemented by employing 
the diagonal loading technique Q, 101 . This same procedure 
is also used for the remaining matrices. 

To avoid the matrix inversion and reduce the complexity, 
we employ the matrix inversion lemma lfT4l to update R (i) 
and R(i) iteratively. The resulting adaptive algorithm, which 
we denominate as JIO-CCM-RLS, is summarized in Table 
UTTl where <&(i) = R (i) and 3>(i) = are defined 

for concise presentation, k(i) G C mx and k(i) G C rxl 
are the full-rank and reduced-rank gain vectors, respectively. 
The recursive procedures are implemented by initializing 
<&(0) = SI mxm and <l>(0) = 5I rxr , where 8 and 8 are 
positive scalars. 



Initialization: 

2"V(1) = [-frxr rx ( m _ r )] ; 

w(l) = T» (l) ai (e )/(\\T? (l)a 7 (0 o )|| 2 ); 

*(0) = 5I mxm ,i(0) = SI rxr ,p{0) = O mx i,p(0) = O rx i. 
Update for each time instant i 

y(t) = w H (i)T?(i)x(i), p(i) = ap{i - 1) + y*(i)x(i) 

U(A\ — q~ 1 »(i-l)a!(i) 

CVIv(*)l a )+a _1 «' H (i)*(i-l)x(i) 

*(*) = oT 1 *^ - 1) - a -1 ^i)a; H (i)*(* - 1) 

= io H (i)Tf (j + l)x(i), a(0 o ) = Tf (j + l)a(6»„) 
sc(i) = T"(i + l)x(i), p(i) = ap(i - 1) + y*{i)x(i) 

Q!~ 1 |'(i-l)^(i) 



fc(i) 



(VIl/WI )+« * (i)#(i-l)S(») 
Hi 



= - 1) - a -J -fc(i)sc H (i)$(i - 1) 

-/• , ii s.^r-/-\ [p H ( i )*( i )"(9o)-7]a(e )-| 

toll + 1) = &(l)\p(l) — 1 - z \ 



2 J /?LS algorithm for the GSC: For the GSC structure, the 
LS cost function is given by 



Ain-gscV- 1 - g: 



(T gsc (i), lOgscW) 



2 

53a < - , {[a y (fl )-B H r glc (i)ti> gIC (i)] H i(0 - l} • 
i=i 

(26) 

Assuming the optimal reduced-rank weight vector w gsc and 
the transformation matrix T gsc (i) are known, respectively, 
computing the gradients of (l26l l with respect to T gsc (i) and 
w gsc (i), and equating them equal to null, we have 

r pB (< + i) = j^(0p fl (0p^j 3 , (27) 

to pc (i+l) = ^(*)p B (i) J (28) 



where ife fl (i) = Ej=i« i_ 'Bx(0» (0- 5 . BxbW = 
}, T^ sc (i)R iB (i)T gsc (i), PB (i) = EU« i_ '[do(0 - WO. 



and p B (i)=T^ c (i)p B (i_ i 

Setting *£ B (i) = flg B (i) and $ 2f3 (i) = and 
employing the matrix inversion lemma yields, 

T gsc {i + 1) = T gsc (i) - ^(^(i), (29) 
it> gS c(i + 1) = w g sc(i) - e gsc (i)fc_B(i): (30) 
where fe B (i) G C( m_1 ) xl and fc B G C rxl are gain vectors, 

and u>(i) is defined by (0. A summary of the reduced-rank 
RLS algorithm with the CCM design for the GSC is given in 
Table [TV] 

C. Gram-Schmidt Technique for Problem 2 

As mentioned before, the transformation matrix T r (i + 1) 
for the DFP is constituted by a bank of full-rank filters 
tj(i + 1) (j — 1, ...,r), which cannot be guaranteed to 
be orthogonal. According to the optimization problem 2 in 
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TABLE IV 

THE JIO-CCM-RLS ALGORITHM FOR GSC 



Initialization: 

Tgsc(l) = [Irxr O r x(m— r)] i U'gsc(l) = -frxl 
*i B (0) = 5I mxm , (0) = SI rxr . 

Update for each time instant i 

w(i) = a 7 (6» ) - B H T gic {i)w gx {i), y gx {i) = w H (i)x(i) 
= Vtsc('i) x ('i), x B {i) = Bx(i), e gsc (i) = 1 — w H (i)x(i 

, f\ _ ""^xg (»-l)a B (i) 

S '*' _ l + a- 1 Sg(i)*i B (i-l)aB(i) 

*i B («) = a _1 *i B (i - 1) - a~ 1 k B (i)x B (i)*s B (i - 1) 

e^(i) = [1 - i^lip 
T gsc (i + 1) = T gsc (i) - k B (i)e WeK (i) 
x B (i) =Tg c (i + l)x B (i) 

K B (l) — „ a 

V ' 1+c-lfif (i)*i B (i-l)i B (i) 

*s B (i) = a" 1 *^ - 1) - Q-^s^xf (z)** B (i - 1) 

= a 7 (0o) - B H T gsc (i + l)w gsc (i) 
e gS c(i) = 1 — w H (i)x(i) 

w gx (i + 1) = M)gsc(i) - e* gx (i)k B (i) 



112) . the transformation matrix T r (z) can be reformulated to 
compose r orthogonal vectors, which span the same subspace 
generated by the original vectors. The reformulation ensures 
that the projection of the received vector onto each dimension 
is one time and avoids the overlap (e.g., takes the same 
information twice or more). Compared with the original trans- 
formation matrix, the reformulated transformation matrix is 
more efficient to keep the useful information in the generated 
reduced-rank received vector for the parameter estimation. The 
orthogonal procedure is realized by the Gram-Schmidt (GS) 
technique 1491 . Specifically, after the iterative processes for the 
computation of the transformation matrix, the GS technique is 
performed to modify the columns of the transformation matrix 
as follows: 

j'-i 

tj, 0I t(i + 1) = tj(i + 1) - ^proj t; ort(l+1) ^(i + 1), (31) 

;=i 

where tj i0rt (z + l) is the normalized orthogonal vector after the 
GS process. The projection operator is proj t; t u+i)tj(i+l) = 

[*f«t(» + l)*i,<*(i + l)]" 1 + + WiM* + !)• 

The reformulated transformation matrix T r olt (z + 1) is 

constructed after we obtain a set of orthogonal ij, ,t(i + l). By 

employing T r>oR (i + 1) to compute the reduced-rank weight 

vectors, the adaptive algorithms could achieve an improved 

performance. Following the same procedures, we can also 

apply the GS technique to the adaptive algorithms for the GSC 

structure. Simulations will be given to show this result. We 

denominate the GS version of the SG and RLS algorithms as 

JIO-CCM-GS and JIO-CCM-RGS, respectively. 

D. Automatic Rank Selection 

The selection of the rank r impacts the performance of 
the proposed reduced-rank algorithms. Here, we introduce an 
adaptive method for selecting the rank. Related works on the 
rank selection for the MSWF and the AVF techniques have 
been reported in ||271 and ll50l . respectively. Unlike these meth- 
ods, we describe a rank selection method based on the CM 



criterion computed by the filters T^ '(i) and w^ r \i), where 
the superscript (•)' r - ) denotes the rank used for the adaptation 
at each time instant. We consider the rank adaptation technique 
for both the DFP and the GSC structures. Specifically, in 
the DFP structure, the rank is automatically selected for the 
proposed algorithms based on the exponentially-weighted a 
posteriori least-squares cost function according to the CM 
criterion, which is 

Jpc m (TW(i-i),™«(i-i)) = 

j2Q l - l [\w (r)H {l - 1)TW(1 - l)x(l)\ 2 l] 2 , 



i=i 



(32) 



where q is the exponential weight factor that is required as 
the optimal rank r can change as a function of the time 
instant i. From the expressions in Table [fland Table Hill the 
key quantities to be updated for the rank adaptation are the 
transformation matrix T r (i), the reduced-rank weight vector 
w(i), the associated reduced-rank steering vector a(#o) an d 
the matrix 3>(i) (for RLS only). To this end, we express 
T^'ii) and w^ r \i) for the rank adaptation as follows: 



*i,i 

*2,1 



tl.2 



(33) 



>(i)=[ Wl W 2 ... W rmin ... UV m J , 

where r m i n and r max are the minimum and maximum ranks 
allowed, respectively. 

For each time instant i, T^\i) and w^'\i) are updated 
along with the associated quantities a(9o) and 3>(i) for a 
selected r according to the minimization of the cost function in 
(|32| >. The developed automatic rank selection method is given 
by 

(TW(i-l),#(i-l)), 



' opt 



arg mm 



J\ D( 



(34) 



where j is an integer ranging between r m i„ and r max . Note that 
a smaller rank may provide faster adaptation during the initial 
stages of the estimation procedure and a slightly larger rank 
tends to yield a better steady-state performance. Our studies 
reveal that the range for which the rank r of the proposed 
algorithms have a positive impact on the performance is very 
limited, being from r m ; n = 3 to r max = 7. These values are 
rather insensitive to the number of users in the system, to 
the number of sensor elements, and work efficiently for the 
studied scenarios. The additional complexity of this automatic 
rank selection technique is for the update of involved quantities 
with the maximum allowed rank r max and the computation of 
the cost function in ( f32b . With the case of large m, the rank r 
is significantly smaller than m and the additional computations 
do not increase the computational cost significantly. 

The proposed algorithms with the rank adaptation technique 
can increase the convergence rate and improve the output 
performance, and r can be made fixed once the algorithms 
reach the steady-state. Simulation results will show how the 
developed rank adaptation technique works. Note that the same 
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idea can be employed in the algorithms for the GSC structure. 
We omit this part for simplification and readability. 

V. Analysis of the proposed algorithms 

In this section, we provide a complexity analysis of the 
proposed reduced-rank algorithms and compare them with 
existing algorithms. An analysis of the optimization problem 
for the proposed reduced-rank scheme is also carried out. 

A. Complexity Analysis 

We evaluate the computational complexity of the proposed 
reduced-rank algorithms and compare them with the existing 
full-rank and reduced-rank algorithms based on the MSWF 
and the AVF techniques for the DFP and the GSC structures. 
With respect to each algorithm, we consider the CMV and 
the CCM design criteria. The computational requirements are 
described in terms of the number of complex arithmetic oper- 
ations, namely, additions and multiplications. The complexity 
of the proposed and existing algorithms for the DFP is depicted 
in Table [V] and for the GSC in Table [VT] Since we did not 
consider the AVF technique for the GSC structure, we put its 
complexity for the DFP in both tables for comparison. 

For the DFP structure, we can say that the complexity of 
the proposed reduced-rank SG type and extended GS version 
algorithms increases linearly with rm. The parameter m is 
more influential since r is selected around a small range that 
is much less than m for large arrays. The complexity of the 
proposed reduced-rank RLS type and GS version algorithms is 
higher than that of the SG type and quadratic with m and r. For 
the GSC structure, the complexity of the SG type algorithms 
has extra m 2 terms as compared to the DFP structure in terms 
of additions and multiplications due to the blocking matrix 
in the sidelobe canceller. There is no significant difference in 
complexity of the RLS type algorithms due to the presence 
of the blocking matrix since (|29l l and ( f30b are recursive 
expressions and, as compared to non-recursive versions, reduce 
the complexity. 

In order to illustrate the main trends in what concerns the 
complexity of the proposed algorithms, we show in Fig. [3] 
and Fig. @] the complexity of both the DFP and the GSC 
structures in terms of additions and multiplications versus the 
length of the filter m. Since the complexity of the current 
algorithms according to the CMV criterion is a little less than 
that of the CCM criterion, we only plot the curves for the 
CCM criterion for simplification. Note that the values of r are 
different with respect to different algorithms, which are set to 
make the corresponding algorithms reach the best performance 
according to the experiments. The specific values are given in 
the figures. It is clear that the proposed SG type and extended 
GS version algorithms have a complexity slightly higher than 
the full-rank SG algorithm but much less than the existing 
algorithms based on the MSWF and the AVF techniques for 
both the DFP and the GSC structures. The curves of the 
proposed RLS type and GS version algorithms are situated 
between the full-rank RLS and the MSWF RLS algorithms in 
both figures. 



TABLE V 

Computational complexity of algorithms for DFP 



Algorithm 


Additions 


Multiplications 


I K-C IVl V-SCi 


3m. — 1 


4m + 1 


pd cfWA cn 

IK-vA Jyl-ou 


3m 


Arm _L *3 


rK-L-JVl V-KLc> 


4m 2 — m — 1 


5m 2 H- 5m — 1 


FR-CCM-RLS 


5m 2 + 2m — 1 


6m 2 + 6m + 3 


MSWF-CMV-SG 


rm 2 + (r + l)m 


9 i r> 

rm* + Arm 




+2r - 2 


+5r + 2 


MSWF-CCM-SG 


rm 2 + (r + l)m 


rm + Arm 




+4r — 2 


+4r + 4 


M S WF- CM V-RL S 


rm* + (r + l)m 


(r + l)m 2 + 2rm 




i A 9 . 1 i 

+4r^ — 3r — 1 


+5r 2 + 4r 


MSWF-CCM-RLS 


rm 2 + (r + l)m 


(r + l)m 2 + 2rm 




r 2 

+5r — r 


+6r 2 + 7r + 3 


AVF 


(a ir\ 2 i / -i\ 

(4r + 5)m + (r — l)m 


(5r + 8)m 2 




—2r — 1 


+ (3r + 2)m 


TTO.r'MV-'sfi 

J 1 V 7 v . . V 1 V O VJ 


A-WYY} 1 W) 1 7^v» Q 

'-ti lit 1 1 l ^ ' — <j 


A-Vtyi 1 *vk> 1 7t I 
'-t i lit ' * ' • ' i J 
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jio-ccm-sg 
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jio-ccm-gs 
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JIO-CMV-RLS 


4m 2 + (2r - l)m 


5m 2 + (3r + 3)m 




+4r 2 - 4r - 1 


+6r 2 + 4r 


JIO-CMV-RGS 


4m 2 + (5r - 3)m 


5m 2 + 6rm 




+4r 2 - 6r + 1 


6r 2 + 5r - 1 


JIO-CCM-RLS 


5m 2 + rm 


6m 2 + (2r + 6)m 




+5r 2 + 3r - 1 


+5r 2 + 9r + 3 


JIO-CCM-RGS 


5m 2 + (4r - 2)m 


6m 2 + (5r + 3)m 




5r 2 + r + 1 


5r 2 + lOr + 2 



TABLE VI 

Computational complexity of algorithms for GSC 



Algorithm 
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2 
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m 2 + m — 


1 




m 2 + 2m + 1 
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f i 
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f 2 
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2rm 


(r 
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(r 
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(r 
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• — l)m 




(5r + 8)m 2 
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+(3r + 2)m 
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3m 
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8)m 
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f 4 




+7r 2 + 3r + 2 
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f 8 




7r 2 + r + 4 
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4m 2 + (2r - 


7)m 




5m 2 + (2r — 4)m 




+5r 2 - 4r + 3 




+7r 2 + 2r + 1 
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4m 2 + (5r — 


9)m 




5m 2 + (5r - 7)m 




5r 2 - 9r 4 


- 7 




7r 2 + 3 



9 




FR-CCM-SG 
FR-CCM-RLS 
MSWF-CCM-SG (r-6) 
MSWF-CCM-RLS (r-7) 
AVF(r.7) 

JIO-CCM-SO (r.5) 
JIO-CCM-GS (r-5) 
JIO-CCM-RLS (r-5) 
JIO-CCM-RGS (r.5) 



12 16 20 24 28 32 36 40 
Number of Sensor Elements m 



4 8 12 16 20 24 28 32 36 40 
Number of Sensor Elements m 



Fig. 3. Complexity in terms of arithmetic operations versus the length of 
the filter m for the DFP structure. 
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Fig. 4. Complexity in terms of arithmetic operations versus the length of 
the filter m for the GSC structure. 



B. Analysis of the Optimization Problem 

Here, we present the analysis of the proposed reduced-rank 
scheme according to the CCM criterion, which depends on 
the transformation matrix and the reduced-rank weight vector. 
Our approach starts from the analysis of the full-rank constant 
modulus criterion and then utilizes the transformation matrix 
and the reduced-rank weight vector with the received vector 
to express the output. The constraint is enforced during the 
analysis. We will consider the analysis for both the DFP and 
the GSC structures. 

The full-rank constant modulus cost function in (f2]i with 
p = 2 and v = 1 can be written as 

J cm {w(i)) = E[|y(i)| 4 -2|y(i)| 2 + l] 

= E[\w H (i)x(i)x H {i)w(i)\ 2 ] (35) 
- 2E[\w H (i)x(i)\ 2 ] +1, 



where x(i) = J2 k =o C k d k (i)a(9 k ) + n(i) (k = 0, 



,q-D 



from (HJ with D k being the signal amplitude and d k is the 
transmitted bit of the fcth user. Note that we have replaced s k 
in (D by C k d k . 

For the sake of analysis, we will follow the assumption 
in llBTl and consider a noise free case. For small noise 
variance a 2 , this assumption can be considered as a small 
perturbation and the analysis will still be applicable. For 
large cr 2 , we remark that the term 7 can be adjusted for the 
analysis. Under this assumption, we write the received vector 
as x(i) = A(0)Cd(i), where A(0), as before, denotes the 
signature matrix, C(i) = diag[Co, . . . , C g _i] G C qxq , and 



d(i) = [d (i), 



lq-1 



(i)] r eC xl . 



For simplicity, we drop the time instant in the quantities. 
Letting q k = C k w H a(9 k ) and <j = [? , • • • , C 9 -i] T , we have 



J cm = Ek" dd" <;<;" dd" <;] - 2E\<; H dd H <;} + 1 



(36) 



Since d k are independent random variables, the evaluation 
of the first two terms in the brackets in (l36l reads 



S H dd H '« fl dd H \ 



9 -l g-l 

EE 1* 

fc=o ;=o 



\ 2 \di\ 2 <, k ^iQ, 



S H dd H <; = )r\d 

fc=0 



(37) 



2 * * 
fel ^k- 



For the reduced-rank scheme with the DFP structure, we 
have w = T r w. Thus, 

r 

S k = C k {T r w) H a{6 k ) = C k J2t%.a(6 k ), (38) 



where t,-. 



: mxl and tj (j 



1 , . . . , r) is the 



column vector of the transformation matrix T r . 

Given tj{9 k ) = C k t?.a(6 k ) and ^ = ££ =1 t 3 (9 k ), we get 

r r 

3a=EE*i(**)*»(**)- (39) 

j=l n=l 

From d38l and the constraint condition in (JTTJ, it is inter- 
esting to find c; = C07. Substituting this expression and 
into (l37T i. we have 



q-l 



<; H dd H <; = |d |%*?o + V \dkWtkk 



fc=i 



(40) 



\d \ 2 cfa 2 + <; H dd H <i, 



where § = k 1 ,...,^ 1 ] T g C<«- 1)xl and ~d = 

[4...,M T eC ( '" 1)xl . 

Substituting (l40l into d36l l. we get the CCM cost function 
in the form of reduced-rank, i.e., 



Jccm=E[|do|^ V+<r<id 



~,~M, 



-2E[\d \ 2 C -f 2 + ^ dd 3 + 1, 



(41) 



where £ is a function of the transformation matrix and the 
reduced-rank weight vector, as shown in (|38"V This expression 
is important for the reduced-rank CCM analysis. The fact that 
T r and w depend on each other and exchange information 
claims that we need to take both of them into consideration 
for the analysis. The expression in (1381 combines these two 
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quantities together and thus circumvents the complicated pro- 
cedures of performing the analysis separately. Note that (PTTT i 
is a constrained expression since the constraint condition has 
been enclosed in the first term of each bracket. 

We can examine the convexity of ( fTTT > by computing the 
Hessian matrix H with respect to ^ H and that is H = 



8^ 

H 



di 

2E 



d 1 2 Cl 7 2 - 1 ) dd" +i; n dd" ^dd 



~.~.H-~.~M ~~H 



dd"^ H dd" 
(42) J 

where H should be positive semi-definite to ensure the 
convexity of the optimization problem. The second and third 
terms in d42T > yield positive semi-definite matrices, while the 
first term provides the condition \do\ 2 C^ 2 — 1 > to ensure 
the convexity. Thus, J ccm is a convex function of T r and w 
when 



V 



> 



1 



(43) 



\do\ 2 Cf 

For the reduced-rank scheme with the GSC structure, the 
expression of the weight vector has been given in (0. Substi- 
tuting this expression into the definition of g. and considering 
the fact that Ba(6o) = 0, we obtain 



ft 



where t„,„ 



C a"(6 )a(6 ) 



for k = 



-C k J2 r j=1 t^.Ba(e k ) for * = !,.. 



,3-1 



(44) 



t gscJ £ C^" 1 )* 1 and i gscJ (j 



1, . . . , r) is the column vector of T gsc for the GSC structure. 

Given t 3l {8 k ) = 0";;^ (I = l,...,m), where 

a>n(8k) is the ith element of the steering vector with the direc- 
tion 9k and b; G c( m_1 ^ xl is the Zth column vector of the sig- 
nal blocking matrix B, we have 
Thus, for k = 1, . . . , q — 1, 



= E E E E (WnA *)- < 45 > 

j — 1 n— 1 /— 1 p— 1 

Substituting (l44l and d45l l into d37l ), we get the expression 
for the GSC structure, which is 



S H dd H <; 



9-1 

E 

fe=i 



|d | 2 C 2 a ff (0 o )a 7 (fl o )< (0 o )a(0 o ) + 5 H dd*S 



(46) 

which is in the same form as in (|40| | for the DFP structure 
but with the different expression of the quantity Using 
the similar interpretation for the DFP, the quantity in 
(t44t combines the transformation matrix and the reduced-rank 
weight vector together and thus simplifies the analysis. By 
computing the Hessian matrix H, we can obtain the same 
conclusion as shown in d43l . 

VI. Simulations 

In this section, we evaluate the output signal-to-interference- 
plus-noise ratio (SINR) performance of the proposed adaptive 
reduced-rank algorithms and compare them with the existing 



methods. Specifically, we compare the proposed SG and RLS 
type algorithms with the full-rank (FR) SG and RLS and 
reduced-rank methods based on the MSWF and the AVF 
techniques for both the DFP and the GSC structures. With 
respect to each algorithm, we consider the CMV and the 
CCM criteria for the beamformer design. We assume that the 
DOA of the desired user is known by the receiver. In each 
experiment, a total of K = 1000 runs are carried out to get 
the curves. For all simulations, the source power (including 
the desired user and interferers) is a 2 = a 2 = 1, the input 
signal-to-noise (SNR) ratio is SNR=10 dB with spatially and 
temporally white Gaussian noise, and 7=1. Simulations are 
performed by an ULA containing m = 32 sensor elements 
with half-wavelength interelement spacing. 

A. Comparison of CMV and CCM Based Algorithms 

In this part, we compare the proposed and existing al- 
gorithms according to the CMV and the CCM criteria for 
the DFP structure of the beamformer design. The simulation, 
which includes two experiments, shows the input SNR versus 
the output SINR. The input SNR is varied between -10 dB 
and 10 dB. The number of users is q = 5 with one desired user. 
Fig. (a) plots the curves of the SG type algorithms based 
on the full-rank, MSWF, AVF and the proposed reduced-rank 
scheme, and Fig. [5] (b) shows the corresponding RLS type al- 
gorithms. The parameters used to obtain these curves are given 
and the rank r is selected to optimize the algorithms. From 
Fig. [5] (a), the output SINR of all SG type methods increases 
following the increase of the input SNR. The algorithms based 
on the CCM beamformer design outperform those based on 
the CMV since the CCM criterion is a positive measure of 
the beamformer output deviating from a constant modulus, 
which provides more information than the CMV for the 
parameter estimation of constant modulus constellations. The 
proposed CCM algorithms achieve better performance than the 
existing full-rank, MSWF and AVF ones. By employing the 
GS technique to reformulate the transformation matrix, the GS 
version algorithms achieve improved performance. Fig. |5](b) 
verifies the same fact but for the RLS type algorithms. It is 
clear that the RLS type algorithms superior to the SG type 
ones for all input SNR values. 

This simulation verifies that the performance of the adaptive 
algorithms based on the CCM beamformer design has a similar 
trend but is better than that based on the CMV for constant 
modulus constellations. Considering this fact, we will only 
compare the CCM based adaptive algorithms in the following 
part for simplification. Note that all the methods in this 
simulation are for the DFP structure. The algorithms for the 
GSC structure show a similar performance, which is given in 
the next part. 

B. Output SINR for the DFP and the GSC 

We evaluate the output SINR performance of the proposed 
and existing algorithms against the number of snapshots for 
both the DFP and the GSC structures in Fig. [6] and Fig. [7] 
respectively. The number of snapshots is N = 500. In Fig. 
[6] the convergence of the proposed SG type and extended 
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Fig. 5. Output SINR versus input SNR with m = 32, q = 5, SNR= 10 dB, 
(a) fi Tr = 0.002, = 0.001, r = 5 for SG, fi Tr = 0.003, = 0.0007, 
r = 5 for GS; (b) a = 0.998, 5 = 5 = 0.03, r = 5 for RLS, a = 0.998, 
5 = 5 = 0.028, r = 5 for RGS of the proposed CCM reduced-rank scheme. 



Fig. 6. Output SINR versus the number of snapshots with m = 32, q = 7, 
SNR= 10 dB, fi Tr = 0.003, /Mb = 0.003, r = 5 for SG, /j/r r = 0.0023, 
/i™ = 0.003, r = 5 for GS, a = 0.998, 5 = 5 = 0.025, r = 5 for RLS, 
a = 0.998, <5 = <5 = 0.02, r = 5 for RGS of the DFP structure. 



GS version algorithms is close to the RLS type algorithm 
based on the MSWF, and the output SINR values are higher 
than other SG type methods based on the full-rank, MSWF 
and AVF. The convergence of the proposed RLS type and 
GS version algorithms is slightly slower than the AVF, but 
much faster than other existing and proposed methods. Its 
tracking performance outperforms the MSWF and AVF based 
algorithms. 

Fig. [7] is carried out for the GSC structure under the same 
scenario as in Fig. [6] The curves of the considered algorithms 
for the GSC show nearly the same convergence and tracking 
performance as those for the DFR It implies that the GSC 
structure is an alternative way for the CCM beamformer 
design. The difference is that the GSC processor incorporates 
the constraint in the structure and thus converts the constrained 
optimization problem into an unconstrained one. The adaptive 
implementation of the GSC beamformer design is different 
from that of the DFP but the performance is similar. The 
following simulations are carried out for the DFP structure 
to simplify the presentation. 

C. Mean Square Estimation Error of the Weight Solution 

In Fig. [8] we measure the mean square estimation error 
E{||io(i) — u>mvdi-|| 2 } between the weight solutions (full-rank) 
of the proposed methods w(i) = T r (i)w(i) and that of the 
minimum-variance-distortionless-response (MVDR) method 

10 ™ m vdr = 7 n ff(f )H-in(e ) » where R is obtained by its 
sample-average estimation. The experiment is carried out with 
the same scenario as in Fig [6] It exhibits that the mean square 
estimation error decreases following the snapshots. The values 
of the proposed SG and RLS type algorithms decrease rapidly 
and reach a relative lower level compared with those of the 
existing methods. Note that tu mv dr is n °t an optimum solution 
for the proposed algorithms but viewed as a referenced weight 
solution since, for the CCM based algorithm, the weight 




-O- JIO-CCM-RGS t 

200 400 600 800 1000 

Number of Snapshots 



Fig. 7. Output SINR versus input SNR with m = 32, q = 7, SNR= 10 dB, 
ii TT = 0.0025, /i„ gsc = 0.002, r =_ 5 for SG, /x Tr = 0.003, ^ gsc = 0.003, 
r = 5 for GS, a = 0.998, 5 = 5 = 0.01, r = 5 for RLS, a = 0.998, 
5 = 5 = 0.0093, r = 5 for RGS of the GSC structure. 



expression is not a pure function of the received data but also 
depends on the previous weighting values. 

D. Output SINR versus Rank r and Automatic Rank Selection 

In the next two experiments, we assess the output SINR 
performance of the proposed and analyzed algorithms versus 
their associated rank r, and check the effectiveness of the 
automatic rank selection technique. The experiment in Fig. 
[9] is intended for setting the adequate rank r of the reduced- 
rank schemes for a given input SNR and number of snapshots. 
The scenario is the same as that in Fig. [6] except that the 
number of snapshots is fixed to be N — 500 and the rank r 
is varied between 1 and 16. The result indicates that the best 
performance of the proposed SG and RLS type algorithms 
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Fig. 8. Square estimation error between the weight solution and the MVDR Fig. 9. Output SINR versus rank r with m = 32, q = 7, SNR= 10 dB. 
weight solution. 



is obtained with rank r = 5 for the proposed reduced-rank 
scheme. The performance of the full-rank methods is invariant 
with the change of the rank r. For the MSWF technique, its 
SG and RLS type algorithms achieve their best performance 
with ranks r = 6 and r = 7, respectively. For the AVF-based 
algorithm, the best rank is found to be r = 7. It is interesting to 
note that the best r is usually much smaller than the number of 
elements m, which leads to significant computational savings. 
For the proposed and analyzed algorithms, the range of r that 
has the best performance is concentrated between r m j n = 3 
and r max = 7. This range is used in the next experiment to 
check the performance of the proposed algorithms with the 
automatic rank selection technique. 

Since the performance of the proposed reduced-rank algo- 
rithms was found in our studies to be a function of the rank r 
and other parameters such as the step size and the forgetting 
factor, we need to consider their impacts on the performance 
of the system. Specifically, we assume that the step size of 
the SG type algorithms and the forgetting vector of the RLS 
type algorithms are adequately chosen and we focus on the 
developed automatic rank selection technique introduced in 
the previous section. 

In Fig. [lOl the proposed reduced-rank algorithms utilize 
fixed values for their rank and also the automatic rank selection 
technique. We consider the presence of q = 10 users (one 
desired) in the system. The results show that with a lower rank 
r = 3 the reduced-rank algorithms usually converge faster but 
achieve lower output SINR values. Conversely, with a higher 
rank r = 7 the proposed algorithms converge relatively slower 
than with a lower rank but reach higher output SINR values. 
The developed automatic rank selection technique allows 
the proposed algorithms to circumvent the tradeoff between 
convergence and steady-state performance for a given rank, 
by adaptively choosing the best rank for a given number of 
snapshots which provides both fast convergence and improved 
tracking performance. 
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Fig. 10. Output SINR versus the number of snapshots with m = 32, q = 10, 
SNR= 10 dB, (a) /j, Tr = 0.003, = 0.004 for SG, nr r = 0.003, 
fiw = 0.001 for GS; (b) a = 0.998, 5 = 5 = 0.03 for RLS, "a = 0.998, 
5 = 5 = 0.026, r = 5 for RGS with the automatic rank selection technique. 



E. Performance in non-stationary scenarios 

In the last experiment, we evaluate the performance of 
the proposed and analyzed algorithms in a non-stationary 
scenario, namely, when the number of users changes. The 
automatic rank selection technique is employed, and the step 
size and the forgetting factor are set to ensure that the 
considered algorithms converge quickly to the steady-state. In 
this experiment, the scenario starts with q = 8 users including 
one desired user. From the first stage (first 500 snapshots) 
of Fig. QT1 the convergence and steady-state performance of 
the proposed SG type algorithms is superior to other SG type 
methods with the full-rank, MSWF and AVF. The proposed 
RLS type algorithm has a convergence rate a little slower than 
the AVF but faster than the other analyzed methods, and the 
steady-state performance better than the existing ones. Three 
more interferers enter the system at time instant i = 500. This 
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change makes the output SINR reduce suddenly and degrades 
the performance of all methods. The proposed SG and RLS 
type algorithms keep faster convergence and better steady- 
state performance in comparison with the corresponding SG 
and RLS type methods based on the full-rank and MSWF 
techniques. The convergence of the AVF method is fast but the 
steady-state performance is inferior to the proposed methods. 
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Fig. 11. Output SINR versus input SNR with m = 32, q 1 = 8, 52 = 11, 
SNR= 10 dB, ii Tr = 0.003, = 0.0038, r = 5_for SG, fi Tr = 0.003, 
IMs = 0.001, r = 5 for GS, a = 0.998, 5 = S = 0.033, r = 5 for 
RLS, a = 0.998, 5 = S = 0.028, r = 5 for RGS of the proposed CCM 
reduced-rank scheme. 



VII. Concluding Remarks 

We proposed a CCM reduced-rank scheme based on the 
joint iterative optimization of adaptive filters for beamformer 
design. In the proposed scheme, the dimension of the received 
vector is reduced by the adaptive transformation matrix that 
is formed by a bank of full-rank adaptive filters, and the 
transformed received vector is processed by the reduced-rank 
adaptive filter for estimating the desired signal. The proposed 
scheme was developed for both DFP and GSC structures. We 
derived the CCM expressions for the transformation matrix 
and the reduced-rank weight vector, and developed SG and 
RLS type algorithms for their efficient implementation. The 
GS technique was employed in the proposed algorithms to 
reformulate the transformation matrix and thus improve the 
performance. The automatic rank selection technique was de- 
veloped to determine the most adequate rank and make a good 
trade-off between the convergence rate and the steady-state 
performance for the proposed methods. The complexity and 
convexity analysis of the proposed algorithms was carried out. 
Simulation results for the beamforming application showed 
that the proposed reduced-rank algorithms significantly out- 
perform the existing full-rank and reduced-rank methods in 
convergence and steady-state performance at comparable com- 
plexity. 



Appendix 

Derivation of d24l) 

In this appendix, we show the details of the derivation of 
the expression for the transformation matrix in (133V Assuming 
w(i) ^ is known, taking the gradient terms of (l3Zt with 
respect to T r {i), we get 

i 

vx unTrW = 2Y,\y{i)\Mi)* H {i)T r (i)w{i)w H (i) 

1=1 

i 

- 2^x{l)x H (l)T r (i)w{i)w H {i) + 2\a{8 )w H (i) 
1=1 

= 2R{i)T r {i)w(i)w H (i) - 2p(i)w H (i) 
+ 2\a(6 )w H (i). 

(47) 

Making the above gradient terms equal to the zero matrix, 
right-multiplying the both sides by w(i), and rearranging the 
expression, it becomes 



T r (i)w(i)=R (i)[p(i) - Xa(9 )] 



(48) 



If we define pj^{i) = R (i) [p{i) — Aa(6* )], the solution 
of T r (i) in (l48b can be regarded to find the solution to the 
linear equation 

T r {i)w(i) = p k {i) (49) 

Given a w(i) ^ 0, there exists multiple T r (i) satisfying 
( |49l > in general. Therefore, we derive the minimum Frobenius- 
norm solution for stability. Let us express the quantities 
involved in d49l by 



T r (i) 



Pi(i)' 

pS) 

Pm{i) 



(50) 



The search for the minimum Frobenius-norm solution of 
( |49l > is reduced to the following m subproblems (j = 
1, . . . , to): 

mm l|p,-MH 2 subject to p-{i)w{i) =p R)j (i), (51) 

The solution to (l5TT l is the projection of p(i) onto the 
hyperplane Hj(i) = {p{i) 6 C lxr : p(i)w(i) = P RJ (i)}, 
which is given by 

w H (i) 



Pj' 1 - l'n., [ '^ 



w(i) 



(52) 



Hence, the minimum Frobenius-norm solution of the trans- 
formation matrix is given by 



T r (i)=P R ( 



w H (i) 



Substituting the definition of p R (i) into ([53), we have 



(53) 



T r (i)=R (i)[p(t) - \a(0o) 



w H (i) 

WW 2 



(54) 
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The multiplier A can be obtained by incorporating (|48l l with 
the constraint w H (i)T^ (i)a(8o) = 7, which is 



A 



p(i)R (i)a(0 o )-7 



(55) 



a"(6 Q )R (i)a(0 o ) 
Therefore, the expression of the transformation matrix in 
d33l l can be obtained by substituting ( T55l l into J54t . 
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